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* Sampling Theory & Basic Concepts 
* Types of Sampling — 
» random, 
» purposive, 
» stratified, 
> Systematic & multistage sampling, 


* Sampling distribution 
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e Sampling theory is a study of relationships 
existing between a population and samples 
drawn from the population. 


* The term sample refers to that part of the 
universe which is selected for the purpose of 
investigation. 
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Types of Sampling 


Probability 
Sampling Techniques 


Convenience |^ 0 @¢ 23n n= Quota 
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Simple random Stratified Cluster 
Sampling | .# Sampling 
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Non Probability Sampling — 4 
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Simple random sampling Cluster sampling 
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1. Simple Random Sampling £ 


* All subsets of the frames are given equal 
probability. 
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1. Simple Random Sampling £ 


Simple Random 3 


Sampling 


6 O0 
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» Select a sample of high school students as follows: 


l. 


2. 


Flip a fair coin 


If heads, select all female students in the school as the 
sample 


. If tails, select all male students in the school as the 


sample 


Each student has an equal chance to be in the sample 
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1. Simple Random Sampling £ 


Advantages: 


$ Minimal knowledge of 
population needed 


$ Easy to analyze data 


Disadvantages: 
© Low frequency of use 
$ Does not use researchers’ expertise 


$ Larger risk of random error 
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2. Systematic Sampling 


Systematic sampling is defined as a probability 
Sampling method where the researcher chooses 
elements from a target population by selecting a 
random starting point and selects sample members 
after a fixed “sampling interval. 
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2. Systematic Sampling 


> When you are sampling, ensure you represent 
the population fairly. 


> Systematic sampling is a symmetrical process 
where the researcher chooses the samples after 
a specifically defined interval. 


> Sampling like this leaves the researcher no room 
for bias regarding choosing the sample. 
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The gym class where the instructor asks the 
students to line up and asks every third person to 
step out of the line. Here, the instructor has no 
influence over choosing the samples and can 
accurately represent the class. 
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For instance, if a local NGO is seeking to form a 
systematic sample of 500 volunteers from a 
population of 5000, they can select every 10th 
person in the population to build a sample 
systematically. 
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Pictorial Representation 
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2. Systematic Sampling 


Advantages: 
$ Moderate cost; moderate usage 
© Simple to draw sample 
© Easy to verify 


Disadvantages: 


$ Periodic ordering required 
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3. Stratified Sampling 


* Population is divided into 2 or more groups 
called Strata 


* Subsamples are randomly selected from each 
strata 
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3. Stratified Sampling 


> Stratified sampling is used when the researcher 
wants to understand the existing relationship 
between two groups. 


> The researcher can represent even the smallest 
sub-group in the population. 
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3. Stratified Sampling - Example 


For example, one might divide a sample of adults 
into subgroups by age, like: 


18-29, 

30-39, 

40-49, 

50-59, and 
60 and above. 
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Advantages: 


© Assures representation of all groups in 
sample population 


© Characteristics of each stratum can be 
estimated and comparisons made 


Disadvantages: 


© Requires accurate information on 
proportions of each stratum 


© Stratified lists costly to prepare 
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4. Cluster Sampling 


Cluster sampling is a probability sampling method 
in which you divide a population into clusters, such 
as districts or schools, and then randomly select 
some of these clusters as your sample. 
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4. Cluster Sampling | 


Population Sample Group 


Clusters Clusters (2 Clusters) 


222 
222 
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| 4. Cluster Sampling i 


An example of single-stage cluster sampling — An 
NGO wants to create a sample of girls across five 
neighbouring towns to provide education. 


Using single-stage sampling, the NGO randomly 
selects towns (clusters) to form a sample and 
extend help to the girls deprived of education in 
those towns. 
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4. Cluster Sampling | 


Advantages: 


© Can estimate characteristics of both cluster 
and population 


Disadvantages: 
© The cost to reach an element to sample is 
very high 
© Each stage in cluster sampling introduces 


sampling error—the more stages there 
are, the more error there tends to be 
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In multistage sampling, or multistage cluster 
sampling, you draw a sample from a population 
using smaller and smaller groups (units) at each 
stage. 


It's often used to collect data from a large, 
geographically spread group of people in national 
surveys. 
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EXERCISE 


Soma motivation required 


A teacher puts students' names in a hat and chooses without looking to get a sample of 3 students. 
What type of sample is this? 


Choose 1 answer: 


Simple random sample 


Stratified random sample 


Cluster random sample 


Systematic random sample 


Mane of the above 


© 
o 
© 
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A large company surveys 100 employees by taking random samples of 10 managers and 90 non- 
managerial employees. 


What type of sample is this? 


Choose 1 answer: 


Simple random sample 


© 


Stratified random sample 


© 


Cluster random sample 


Systematic random sample 


None of the above 


© 
© 
© 
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Each law firm in one state registers its phone number with the state court system. An employee of the 
state court system uses a computer to select 500 random registered phone numbers, and the law firms 
associated with those numbers will be selected for an audit. 


What type of sample is this? 


Choose 1 answer: 


Q, Simple random sample 
Stratihed random sample 
©) Cluster random sample 


(0) Systematic random sample 


(E) None of the above 
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A quality control worker at a factory selects the first 10 items she sees as her sample for the day. 
What type of sample is this? 


Choose 1 answer: 


Simple random sample 


Stratified random sample 


© | © 


Cluster random sample 


Systematic random sample 


None of the above 


© 
© 
© 
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A support hotline asks callers to stay on the line after they have completed their call to respond to a short 
survey. 


What type of sample is this? 


Choose 1 answer: 


Simple random sample 


© © 


Stratified random sample 


Cluster random sample 


Systematic random sample 


© 
© 
© 


None of the above 
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Employees at a website want to select a sample of their users to ask for a donation. They randomly select 
one of the first 25 users each day and show them a message asking for a donation. They also show the 
message to every 25th user from that point on. 


What type of sample is this? 


Choose 1 answer: 


Simple random sample 


Stratified random sample 


© | © 


Cluster random sample 


Systematic random sample 


None of the above 


© 
© 
© 
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A principal orders t-shirts and wants to check some of them to make sure they were printed properly. She 
randomly selects 2 of the 10 boxes of shirts and checks every shirt in those 2 boxes. 


What type of sample is this? 


Choose 1 answer: 


(A) Simple random sample 
Stratified random sample 
© Cluster random sample 


©) Systematic random sample 


(E) None of the above 
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Its QUESTION TIME !! 
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pz 
THANK YOU 
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Choose two variables and collect data with structured 7 


pointer likert based questionnaire. 
Minor Statistical Group | (Remember & understand the concepts of correlation & 


Project - Sample Collection | regression) 
through 7 point Likert 
based Questionnaire & 
then applying Correlation & | - Explain sample type & Frame Hypothesis; 
Regression using SPSS, | - Apply statistical tool with the help of SPSS 21.0 software 
men upon E Understand & Analyze the output generated and calculate 
cede he the Correlation & Regression coefficients 
Recommendations & 
Conclusions with 


appropriate webliography [Analyze & Evaluate the results of your analysis. 


Create future scope of the project 


. Dr.Gitanjali Shrivastava SLS,Pune 


Home - Activity | 


Design a questionnaire based on 5 or 7 
point likert scale 
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Dr. Rensis Likert 


(1903 - 1981) 


The original idea for the likert scale is found in Rensis Likert’s 
1932 article in Archive of psychology titled “ A technique for the 
measurement of Attitudes”. This idea was expanded by Likert's 
1934 Journal of social psychology article titled “ A simple and 
Reliable method of scoring the Thurstone Attitude Scales“. 
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professional in appearance 
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Codes: 1-strongly disagree, 2- disagree, 3- neutral, 4-agree, 5- strongly agree 
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1 
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2 
(ix) I am comfortable complex/Advance search 1 D 
x) I am comfortable quick search 1 2 


1) I am comfortable when using it 1 2 
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Pictorial Concept 


Population 


C) Sample 


A subset of the 
population. 


Notation 


Population: 
parameter 
(Greek) 
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* A characteristic number, or quantity that 
increases or decreases over time, or takes 
different values in different situations. 


e Something which is not fixed. 


VADO 9€ is a Variable? 


- 


Ue 
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Types of Variables | 


VARIABLES 


— —— I 


INDEPENDENT DEPENDENT CONTROLLED 
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* The "independent variables" represent the 
inputs or causes, or are tested to see if they 
are the cause. 


— What is tested 
— What is manipulated 
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e Something that might be affected by the 
change in the independent variable 


— What is observed 


— What is measured 


— The data collected during the investigation 
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Controlled Variables | 


* Controlled Variable is a variable that is not 
changed 


— Also called constants 


— Allow for a "fair test" 
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Example: Independent Variable4 


* Ages of the students 


— Different ages were tested by the scientist 
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Example: Dependent Variableg 


* The time,factors,consequences 


— The time was observed and measured by 
the scientist 
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Example: Controlled Variable 4 


* Some puzzle 


— All of the participants were tested with the 
puzzle. 


—|t would not have been a fair test if some 
had an easy 30 piece puzzle and some had a 
harder 500 piece puzzle. 
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* Independent variable — temperature of water 


* Dependent variable — time to cook a vegetable 


* Controlled variable — type of vegetable 
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* Independent variable — depth of the water 


* Dependent variable — temperature 


* Controlled variable — thermometer 
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>A supposition or explanation (theory) that is 
provisionally accepted in order to interpret certain 
events or phenomena, and to provide guidance for 
further investigation. 


— 


THE 
EGG? 


THE 
CHICKEN? 
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>A hypothesis may be proven correct or wrong, 
and must be capable of refutation. 


> If it remains un-refuted by facts, it is said to be 
verified or corroborated. 
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e The goal in Hypothesis Testing is to analyze a 
sample in an attempt to distinguish between 
population characteristics that are likely to 
occur and population characteristics that are 
unlikely to occur. 
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Pictorial Representation 
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* Elements ofa hypothesis test: 


—Null hypothesis - Statement regarding the 
value(s) of unknown parameter(s). Typically 
will imply no association between explanatory 
and response variables in our applications (will 
always contain an equality) 


— Alternative hypothesis - Statement 
contradictory to the null hypothesis (will 
always contain an inequality) 
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Q^ solid question 
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null hypothesis: 


dogs live longer 
than cats 


dogs don''t live 
longer than cats 
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Null Hypothesis vs. Alternative 


Null Hypothesis Alternative Hypothesis 
e Statement about the œ Statement about the 
value of a population value of a population 
parameter parameter that must 
e Represented by Ho be true if the null 
* Always stated as an hypothesis is false 
Equality e Represented by H, 
e Stated in on of three 
forms 
. > 
. < 
. £ 
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Null Hypothesis vs. Alternative 
NULL HYPOTHESIS exameces 


THE NULL HYPOTHESIS ASSUMES THERE IS NO RELATIONSHIP BETWEEN TWO VARIABLES 
AND THAT CONTROLLING ONE VARIABLE HAS NO EFFECT ON THE OTHER. 
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e ais the probability of Type I error 


e Bis the probability of Type II error 


e The experimenters have the freedom to set 
the a-level for a particular hypothesis test. 
That level is called the level of significance 


for the test. 
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Type | and Type Il Errors | 


True State of Nature 


The null The null 
hypothesis is | hypothesis is 
true false 


Type I error 
(rejecting a true 
null hypothesis) 
a 


We decide to 
reject the 
nuli hypothesis 


Correct 
decision 


Type II error 
(rejecting a false 
null hypothesis) 


p 


Decision 


We fail to 
reject the 
null hypothesis 


Correct 
decision 
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conclusion from 
statistical analysis 


Accept the Null Reject the Null 
Null hypothesis 
is true Type | Error 
Correct man 
reje a true 
£ Conclusion null hypothesis 


the true state 


of nature 
Į Type Il Error 


accept a false 
Null hypothesis null hypothesis 
is false 


Correct 
Conclusion 
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e Every hypothesis test ends with the 
experimenters either 


e Rejecting the Null Hypothesis, or 
e Failing to Reject the Null Hypothesis 


e As strange as it may seem, one never 
accept the Null Hypothesis. 
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The Seven Steps... É 


1) Describe in words the population 
characteristic about which hypotheses are 
to be tested 

2) State the null hypothesis, Ho 


3) State the alternative hypothesis, H, or H, 


4) Display the test statistic to be used 
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The Seven Steps... | 


5) Identify the rejection region 
e |sitan upper, lower, or two-tailed test? 


e Determine the critical value associated 
with a, the level of significance of the test 


6) Compute all the quantities in the test 
Statistic, and compute the test statistic itself 
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The Seven Stens... | 


7) State the conclusion. 
That is, decide whether to reject the null 
hypothesis, Ho, or fail to reject the null 
hypothesis. 


The conclusion depends on the level of 
significance of the test. Also, remember to 
state your result in the context of the specific 
problem. 
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EXERCISE 


Soma motivation required 


> State the null and alternative hypotheses for each of 
the following situations. (That is, identify the correct 
number uO and write HO:u=u0 and the appropriate 
analogous expression for Ha.) 


* The average July temperature in a region historically 
has been 74.50F. Perhaps it is higher now. 


* The average weight of a female airline passenger 
with luggage was 145 pounds ten years ago. The FAA 
believes it to be higher now. 
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* The average stipend for doctoral students in a 
particular discipline at a state university is $14,756. 
The department chairman believes that the national 
average is higher. 


* The average room rate in hotels in a certain region 
is $82.5. A travel agent believes that the average ina 
particular resort area is different. 


* The average farm size in a predominately rural state 
was 69.4 acres. The secretary of agriculture of that 
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* Answers 


— H0O:u=74.5 vs Ha:u>74.5 
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* Answers 


— H0O:u=74.5 vs Ha:u>74.5 


— HO:u=145 vs Ha:u>145 
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* Answers 


— H0O:u=74.5 vs Ha:u>74.5 


— HO:u=145 vs Ha:u>145 


— H0:u=14756 vs Ha:u>14756 
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* Answers 
— H0O:u=74.5 vs Ha:u>74.5 


— HO:u=145 vs Ha:u>145 


— H0:u=14756 vs Ha:u>14756 


— H0:u=82.53 vs Ha: u#82.53 
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* Answers 
— H0O:u=74.5 vs Ha:u>74.5 


— HO:u=145 vs Ha:u>145 
— HO:u=14756 vs Ha:u>14756 


— H0:u=82.53 vs Ha: u#82.53 


— H0:u=69.4 vs Ha:u<69 
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How to Give Bibliography??? 


1 
2 
3 
A 
5 
6 
/ 


. Author name. 

. Title of the publication. 

. Date of publication. 

. The place of publication of a book. 
. The publishing company of a book. 
. The volume number of a magazine. 
. The page number(s). 
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Research Paper Prototype 


Sample Research Paper 
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How to Give References??? 


e Bhatt, P.G., and Vyas, V.H. (2014). A Study on 
Performance Evaluation of Selected Equity Mutual Funds 
in India. International Journal of Multidisciplinary 
Research, 3(6), 34-38. 


* Bogle, J.C. (1992). Selecting Equity Mutual Funds. The 
Journal of Portfolio Management, 18(2), 94-100. 


e et. al., 
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How to Give Webliography??? 


e http://hdl.handle.net/10603/206228 


[Accessed on 1** JUN 2020 at IST 12:20 PM]. 
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* All the references of the presentation are mentioned in the 
reference section. 


e If by chance, any reference is not quoted due to human 
error, we owe our full respect and hearttelt gratitude to all 
the esteemed authors / publishers / journals, so on and so 
forth. 
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Measurement and Scaling 


>A scale is basically a continuous spectrum or 
series of categories and has been defined as 
any series of items that are arranged 
progressively according to value or 
magnitude, into which an item can be placed 
according to its quantification 


Nominal 


)perations Engineering 


Ordinal 


3rd 4th 


Interval 
100-199 200-299 300-399 400-499 
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Types of Scales | 
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1. Nominal Scale É 


>A nominal scale is the simplest of the four 
scale types and in which the numbers or 
letters assigned to objects serve as labels for 
identification or classification 


> Example: 


= Males = 1, Females = 2 
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2. Ordinal Scale 


> An ordinal scale is one that arranges objects 
or alternatives according to their magnitude. 


> Examples: 


" Career Opportunities = Moderate, Good, 
Excellent 


" Investment Climate = Bad, inadequate, fair, 
good, very good 


= Merit = A grade, B grade, C grade, D grade 
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A problem with ordinal scales is that the 
difference between categories on the scale is 
hard to quantify, 


i,e., excellent is better than good but how 
much is excellent better? 
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>An interval scale is a scale that not only 
arranges objects or alternatives according to 
their respective magnitudes, but also 
distinguishes this ordered arrangement in 
units of equal intervals. 


» i.e. interval scales indicate order (as in ordinal 
scales) and also the distance in the order. 
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Example - interval Scale 


> Examples: 
> Consumer Price Index 
> Temperature Scale in Fahrenheit 


NOTE: Interval scales allow comparisons of 
the differences of magnitude (e.g. of 
attitudes) but do not allow determinations 
of the actual strength of the magnitude 
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4. Ratio Scale É 


> A ratio scale is a scale that possesses absolute 
rather than relative qualities and has an 
absolute zero. 


> Examples: 
> Money 
> Weight 
> Distance 
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Type of Scale Numerical Operation Descriptive Statistics 
Counting Frequency in each 
category, percentage in 
each category, mode 


Arithmetic Operations | Mean, standard 
on Intervals between deviation, variance 
numbers 


Arithmetic Operations | Geometric mean, 
on actual quantities coefficient of variation 


Rank Ordering Median, range, 
percentile ranking 
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Examples of Measurement Scales 
Nominal- Ordinal-Level | Interval-Level| Ratio-Level 
Level Data Data Data Data 
Zip Code Grade SAT score Height 
(A, B, C, D, F) 
Gender (MI, F} Judging (1** place Weight 
2*3 place etc.) 
Eye Color Rating scale (poor, Temperature Time 
(blue browngreen,h good, excellent) 
azel) 
Political affiliation Ranking of tennis Salary 
players 
-"-——— "il —1-*— 
Ivlajor field (math, 
computers, etc.) 
ll ÉL É LLL É 
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THANK YOU 
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Correlation & Regression 


* Correlation is a statistical term describing the 
degree to which two variables move in 
coordination with one another. 


e If the two variables move in the same 
direction, then those variables are said to 
have a positive correlation. 


* If they move in opposite directions, then they 
have a negative correlation. 
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e Correlation coefficients are used to measure the 
strength of the linear relationship between two 
variables. 


* Acorrelation coefficient greater than zero 
indicates a positive relationship while a value 
less than zero signifies a negative relationship. 


e A value of zero indicates no relationship 
between the two variables being compared. 
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* Calculating the correlation coefficient is time- 
consuming, so data are often plugged into a 
calculator, computer, or statistics program to 
find the coefficient. 
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* The correlation coefficient (p) is a measure 
that determines the degree to which the 
movement of two different variables is 
associated. 


* The most common correlation coefficient, 
generated by the Pearson product-moment 
correlation, is used to measure the linear 
relationship between two variables. 
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e [he possible range of values for the 
correlation coefficient is -1.0 to 1.0. In other 
words, the values cannot exceed 1.0 or be less 
than -1.0. 
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Example - 1 


Correlation and the Financial Markets 


* In the financial markets, the correlation 
coefficient is used to measure the correlation 
between two securities. 


* For example, when two stocks move in the 
same direction, the correlation coefficient is 
positive. Conversely, when two stocks move in 
opposite directions, the correlation coefficient 
is negative. 
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* For example, suppose that the prices of coffee 
and computers are observed and found to 
have a correlation of +.0008. 


* This means that there is no correlation, or 
relationship, between the two variable 
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Correlation Coefficient 


Positive Correlation Negative Correlation No Correlation 
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Positive Correlation 


* A positive correlation—when the correlation 
coefficient is greater than O— signifies that 
both variables move in the same direction. 
When p is +1, it signifies that the two variables 
being compared have a perfect positive 
relationship; when one variable moves higher 
or lower, the other variable moves in the same 
direction with the same magnitude. 
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The value of fuel prices is directly related to the prices 
of airplane tickets, with a correlation coefficient of 
+0.95. 


Ho: There is no significant impact of fuel prices on the 
prices of airplane tickets. 


Ha: There is significant impact of fuel prices on the 
prices of airplane tickets. 
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The value of fuel prices is directly related to the prices 
of airplane tickets, with a correlation coefficient of 
+0.95. 


* The relationship between fuel prices and airfares has 
a very strong positive correlation since the value is 
close to +1. 


* So, if the price of oil decreases, airfares also 
decrease, and if the price of oil increases, so do the 
prices of airplane tickets. 


Business Statistics By: Dr.Gitanjali Shrivastava 


Negative Correlation 


* A negative (inverse) correlation occurs when 
the correlation coefficient is less than 0. 


* This is an indication that both variables move 
in the opposite direction. 


* In short, any reading between 0 and -1 
means that the two securities move in 
opposite directions. When p is -1, the 
relationship is said to be perfectly negatively 
correlated. 
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* Suppose a study is conducted to assess the 
relationship between outside temperature 
and heating bills. 


* The correlation coefficient is calculated to be 


-0.96. 
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* The study concludes that there is a negative 
correlation between the prices of heating bills 
and the outdoor temperature. 


* The correlation coefficient is calculated to be - 
0.96. This strong negative correlation signifies 
that as the temperature decreases outside, 
the prices of heating bills increase (and vice 
versa). 
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Lecture 4 : Correlation Formula 


Calculating p or r 


* The covariance of the two variables in 
question must be calculated before the 
correlation can be determined. 


* Next, each variable's standard deviation is 
required. 


* The correlation coefficient is determined by 
dividing the covariance by the product of the 
two variables standard deviations. 
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Standard deviation is a measure of 
the dispersion of data from its average. 
Covariance is a measure of how two variables 
change together. 

The normalized version of the statistic is 
calculated by dividing covariance by the 
product of the two standard deviations. 


This is the correlation coefficient. 
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* Karl Pearson’s Coefficient of Correlation 


* The Karl Pearson correlation coefficient 
method, is quantitative and offers numerical 
value to establish the intensity of the linear 
relationship between X and Y. Such a 
coefficient correlation is represented as ‘r’. 
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e Cov = 9.6 
e Var x =16; std dev = 4 
e Var y = 9; std. dev= 3 
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Formula — Direct Method | 


— 


Correlation = p = 


Formula — Direct Method 


= —= co he »y) 
> us m 
p d = > C <= C ese 


NEE»ETI Sao — 
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Terminology | 


* r,, the correlation coefficient of the linear 
relationship between the variables x and y 


e x,— the values of the x-variable in a sample 
e X—the mean of the values of the x-variable 
* y; — the values of the y-variable in a sample 


e y—the mean of the values of the y-variable 
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Shortcut Method 


* The Karl Pearson Coefficient of Correlation 
formula is expressed as - 


nd Sry) — (Sr) (Ey) 
ža? — (Soar)? l] [Ey — (ey) ] 


In this formula, 


A is mean of A variable. 


XY is mean of v variable. 
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Step Deviation Method/Short Cut Method 
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Direct Method 


Q1. Calculate  Pearson’s coefficient of 
correlation between advertisement and sales as 
per the data given below with the framing of the 
hypothesis. 


Sales 47 53 58 86 62 68 60 91 51 84 
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Step 1: Frame Hypothesis 


* Ho: There is no significant impact of 
advertisement cost on sales. 


e Ha: There is significant impact of 
advertisement cost on sales. 
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Step 2: Find >X & SY 


Sales Y 


65 53 
62 58 
90 86 
82 62 
75 68 
25 60 
98 91 
36 51 
78 84 
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Step 2: Find >X & SY 


Sales Y 


65 53 
62 58 
90 86 
82 62 
75 68 
25 60 
98 91 
36 51 
78 84 
5X = 650 SY = 660 
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Step 3: Find out mean of X & Y 
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Step 3: Find out mean of X & Y 


A -=650/10 = 65 


y  =660/10 = 66 
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Step 3: Find out mean of X & Y 


A -=650/10 = 65 


y  =660/10 = 66 
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Step 4: Find out dx & dy 


Sales Y dy=y-y |dy? dx.dy 
; 66 
39 -26 47 -19 


65 0 53 -13 
62 58 
90 86 
82 62 
75 68 
25 60 
98 91 
36 51 
78 84 
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Step 4: Find out dx? & dy? 


Sales Y dy=y-y |dy? dx.dy 
; 66 
39 -26 676 47 -19 361 


65 0 0 53 ale) 169 
62 =2 58 -8 

90 25 86 20 

82 17 62 -4 

75 10 68 2 

25 -40 60 -6 

98 33 91 25 

36 -29 51 xus 

78 13 84 18 
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Step 5: Find out dx:dy 


Sales Y dy=y-y |dy? dx.dy 
; 66 
39 -26 676 47 -19 361 494 


65 0 0 53 -13 169 0 
62 -3 9 58 -8 64 

90 25 625 86 20 400 

82 17 289 62 -4 16 

75 10 100 68 2 4 

25 -40 1600 60 -6 36 

98 33 1089 91 25 625 

36 -29 841 51 us 225 

78 13 169 84 18 324 


Business Statistics By: Dr.Gitanjali Shrivastava 


Step 5: Find out dx: dy 


Sales Y dy=y-y |dy? dx.dy 
; 66 
39 -26 676 47 -19 361 494 


65 0 0 53 -13 169 0 
62 -3 9 58 -8 64 24 
90 25 625 86 20 400 500 
82 17 289 62 -4 16 -68 
75 10 100 68 2 4 20 
25 -40 1600 60 -6 36 240 
98 33 1089 91 25 625 825 
36 29 841 51 = NS) 225 435 
78 13 169 84 18 324 234 
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Step 6: Find out }dx2,>dy2, *dx.dy 


Sales Y dy=y-y |dy? dx.dy 
; 66 
39 -26 676 47 -19 361 494 


65 0 0 53 -13 169 0 
62 -3 9 58 -8 64 24 
90 25 625 86 20 400 500 
82 17 289 62 -4 16 -68 
75 10 100 68 2 4 20 
25 -40 1600 60 -6 36 240 
98 33 1089 91 25 625 825 
36 29 841 51 = NS) 225 435 
78 13 169 84 18 324 234 
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Step 6: Find out }dx2,>dy2, *dx.dy 


Sales Y dy-y-y |dy? dx.dy 
; 66 
39 -26 676 47 -19 361 494 


65 0 0 53 -13 169 0 

62 S 9 58 -8 64 24 
90 25 625 86 20 400 500 
82 17 289 62 -4 16 -68 
75 10 100 68 2 4 20 
25 -40 1600 60 -6 36 240 
98 33 1089 91 25 625 825 
36 229 841 51 = NS) 225 435 
78 13 169 84 18 324 234 


5398 2224 2704 | 


Step 7: Find Pearson’s coefficient of correlation 


(= 2/04 
73.4710 x 47.1593 


r=+0./804 
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Step 8: Give your Decision | 


r = + 0.7804 


Ho: There is no significant impact of advertisement 
cost on sales. 


Ha: There is significant impact of advertisement cost 
on sales 
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M 


PE 


Its QUESTION TIME !! 
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Direct Method 


Q2. Calculate Pearson's coefficient of correlation 
between online sale and expenses of the 
following firm as per the data given below with 
the framing of the hypothesis. 


Sales 
(‘000) 


Expe 

nses 

(“000 
) 
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Step 1: Frame Hypothesis 


* Ho: There is no significant impact of online 
sales on expenses. 


* Ha: There is significant impact of online sales 
on expenses. 
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Step 2: Find >X & SY 


Expenses Y |dy=y-y 


50 13 
55 14 
60 16 
65 16 
65 15 
65 15 
60 14 
60 13 
50 13 
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Step 2: Find >X & >Y 


50 13 
55 14 
60 16 
65 16 
65 15 
65 15 
60 14 
60 13 
50 13 
5X = 580 SY = 140 
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Step 3: Find out mean of X & Y 
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Step 3: Find out mean of X & Y 


A = 580/10 = 58 


y  =140/10 = 14 
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Step 4: Find out dx & dy 


Sales Y dy=y-y |dy? dx.dy 
; 14 
50 -8 11 -3 


50 -8 13 =| 
55 14 
60 16 
65 16 
65 15 
65 15 
60 14 
60 13 
50 13 


— 5X = 580 XY = 140 ] 


Step 4: Find out dx? & dy? 


Sales Y dy=y-y |dy? dx.dy 
; 66 
50 -8 64 11 -3 9 


50 -8 64 13 = 1 
55 =2 14 0 

60 2 16 2 

65 7 16 2 

65 7 15 1 

65 7 15 1 

60 2 14 0 

60 2 13 = 

50 -8 13 =i 


~ 5X=580 YY = 140 L 


Step 5: Find out dx:dy 


Sales Y dy=y-y |dy? dx.dy 
; 66 
50 -8 64 11 -3 9 24 


50 -8 64 13 = 1 8 
59 -3 9 14 0 0 
60 2 4 16 2 4 
65 7 49 16 2 4 
65 7 49 15 1 1 
65 7 49 15 1 1 
60 2 4 14 0 0 
60 2 4 13 = 1 
50 -8 64 13 =_ 1 


3X = 580 XY = 140 [ 


50 
55 
60 
65 
65 
65 
60 
60 
50 
3X = 580 


-8 


64 
9 
4 
49 
49 
49 
4 
4 
64 


Step 5: Find out dx: dy 


Sales Y dy=y-y | dy? dx.dy 
; 66 
50 -8 64 11 -3 9 24 


13 
14 
16 
16 
15 
15 
14 
13 
13 


=l 


0 


-1 


=_ 


1 


^5 > o 


1 


1 


E 


8 


SY = 140 | 


Step 6: Find out }dx2,>dy2, *dx.dy 


Sales Y dy=y-y |dy? dx.dy 
; 66 
50 -8 64 11 -3 9 24 


50 -8 64 13 =l 1 8 
55 S 9 14 0 0 0 
60 2 4 16 2 4 4 
65 7 49 16 2 4 14 
65 7 49 15 1 1 7 
65 7 49 15 1 1 7 
60 2 4 14 0 0 0 
60 2 4 13 -1 1 2 
50 -8 64 13 =_ 1 8 


| YX = 580 SY = 140 | 


Step 6: Find out }dx2,>dy2, *dx.dy 


50 -8 64 13 -1 1 8 
3X = 580 3dx? = YY = 140 3dy? = 3dx.dy = 70 
360 22 
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Step 7: Find Pearson’s coefficient of correlation 


18.97 x 4.69 
r=+0./86 
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Step 8: Give your Decision | 


r = + 0.786 


Ho: There is no significant impact of online sales on 
expenses. 


Ha: There is significant impact of online sales on 
expenses. 
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Its QUESTION TIME !! 
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Lecture 5 - Shortcut Method 


* The Karl Pearson Coefficient of Correlation 
formula is expressed as - 


ndau) — (Er) (sy) 
[er — (5:)?][va2234? — (2)? ] 


In this formula, 


A is mean of A variable. 


XY is mean of Y variable. 
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Step Deviation Method/Short Cut Method 
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Shortcut Method 


Q1. Calculate Pearson's coefficient of correlation 
between fertilizer used and productivity of the 
following firm as per the data given below with 
the framing of the hypothesis. 


"wx" 85 93 95 105 120 130 150 160 


y (Metric 
tonnes) 


Step 1: Frame Hypothesis 


* Ho: There is no significant impact of fertilizer 
used on the productivity of the farm 


* Ha: There is significant impact of fertilizer 
used on the productivity of the farm 
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Step 2: Decide Assumed Mean 


Fertilizer Productivit 
used y 
Y 
18 93 
20 95 
24 105 
30 120 
35 130 
40 150 
50 160 
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Step 3: Find out dx & dy 


Fertilizer Productivit 
used y 
X Y 


18 -11 93 -26 
20 95 

24 105 

30 120 

35 130 

40 150 


50 160 


Step 4: Find out dx? & dy? 


Fertilizer Productivit 

used y 

X Y 

15 -14 196 85 -34 1156 
18 -11 121 93 -26 676 
20 -9 95 -24 

24 -5 105 -14 

30 1 120 1 

35 6 130 11 

40 11 150 31 


50 21 160 41 


Step 5: Find out dx:dy 


Fertilizer Productivit | dy =y — 

used 

X 

15 -14 196 85 -34 1156 476 
18 -11 121 93 -26 676 286 
20 -9 81 95 -24 576 

24 -5 25 105 -14 196 

30 1 1 120 1 1 

35 6 36 130 11 121 

40 11 121 150 31 341 


50 21 441 160 41 1681 


Step 5: Find out dx: dy 


Fertilizer 
used 
X 


15 
18 
20 
24 
30 
35 
40 
50 


11 


21 


196 
121 


441 


Productivit 


y 
M 


85 
93 
95 
105 
120 
130 
150 
160 


861 


Step 6: Find out }dx2,>dy2, *dx.dy 


Fertilizer 
used 
X 


15 
18 
20 
24 
30 
35 
40 
50 


11 


21 


196 
121 


441 


Productivit 

y 

Y 

85 -34 
93 -26 
95 -24 
105 -14 
120 1 
130 11 
150 31 


160 41 


861 


Step 6: Find out }dx2,>dy2, *dx.dy 


Fertilizer 
used 
X 


18 
20 
24 
30 
35 
40 
50 


Total 


>dx =0 


Productivit 


y 
M 


93 -26 676 286 
95 -24 576 216 
105 -14 196 70 
120 1 1 1 
130 11 121 66 
150 31 961 341 
160 41 1681 861 
ždy = ždy* = ždx.dy = 


Step 7: Find Pearson’s coefficient of correlation 
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Step 7: Find Pearson’s coefficient of correlation 


(= 2317 -0 
/ 1022 * 5343.5 


r=+0.9917 


There is strong degree of positive correlation 
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Step 8: Give your Decision | 


r = + 0.9917 


Ho: There is no significant impact of fertilizer used on 
the productivity of the farm 


Ha: There is significant impact of fertilizer used on the 
productivity of the farm 
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M 


PE 


Its QUESTION TIME !! 
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Step Deviation Method/Short Cut Method 
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Shortcut Method | / 


Q2. Calculate Pearson's coefficient of correlation 
between online classes and performance of the 
students as per the data given below with the 
framing of the hypothesis. 


Performan 2 4 6 9 10 12 


ce (Marks) 
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Step 1: Frame Hypothesis 


* Ho: There is no significant impact of online 
classes on the performance of the students 


* Ha: There is significant impact of online 
classes on the performance of the students 
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Step 2: Decide Assumed Mean 


Online Performanc | dy =y- 
Classes 


Business Statistics By: Dr.Gitanjali Shrivastava 


: Find out dx & dy 


Online Performan 
Classes 


11 12 


: Find out dx? & dy? 


Online Performan 
Classes 


3 2 4 4 -4 16 
5 0 6 =% 

7 2 8 0 

9 4 10 2 


Step 5: Find out dx:dy 


Online 
Classes 


Find out dx: dy 


Online Performan 
Classes 


3 E 4 4 -4 16 8 
5 0 0 6 2 4 0 
7 2 4 8 0 0 0 
9 4 16 10 2 4 8 
11 6 36 12 4 16 24 


Online 


Classes 


1 -4 16 
3 =?) 4 
5 0 0 
7 2 4 
9 4 16 
11 6 36 
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: Find out »dx?,*dy?, >dx.dy 


Performan 


2 -6 36 
4 4 16 
6 2 4 
8 0 0 
10 2 4 
12 4 16 
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24 


Find out »dx?,*dy?, >dx.dy 


Online Performan 
Classes 


Total 3dx =6 >dx? = 76 3dy =- 3dy? = >dx.dy = 64 
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Step 7: Find Pearson’s coefficient of correlation 
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Step 7: Find Pearson’s coefficient of correlation 


r= 64 —(-6 
V  76-— 6)*(76-6) 


[=+1 


There is perfect positive correlation 
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Step 8: Give your Decision | 


[=+1 


Ho: There is no significant impact of online classes on 
the performance of the students 


Ha: There is significant impact of online classes on 
the performance of the students 
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Its QUESTION TIME !! 
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pz 
THANK YOU 
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Spearman’s Rank Correlation 


The rank correlation is a non-parametric statistical 
procedure. 


It is used for determining the bivariate correlation of 
two at least ordinal scaled attributes, whereby two 
ranking sequences are compared with each other. 


In order to determine the size of the correlation it is 
necessary to calculate Soearman’s or Kendall-Tau’s rank 
correlation coefficient. 
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Rank Correlation - Formula 


A 6 > di 
n(n? — 1) 
P = Spearman's rank correlation coefficient 


- difference between the two ranks of each 
observation 


p= 


Th = number of observations 
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Q1. Spearman's Rank Correlation 


In a quiz competition the two judges Mr. Sam 
and Ms Joe has given the following marks to 
law school students. 


Pm 


Joe 


Apply the spearman's rank correlation and obtain 
the result 


Business Statistics By: Dr.Gitanjali Shrivastava 


Step 1 : Obtain the Ranks of X & Y 


Judge Sam Judge Joe Difference of 
X Y Ranks 
(d)=R,-R, 
75 1 


60 

34 32 
40 34 
50 40 
45 45 
41 33 
22 12 
43 30 
42 36 
66 1 72 2 
64 2 41 


46 57 


Step 1 : Obtain the Ranks of X & Y 


Judge Sam Judge Joe Difference of 

X M Ranks 
(d)=R,-R, 

60 3 75 1 

34 11 32 10 

40 10 34 

50 4 40 

45 6 45 4 

41 9 33 

22 12 12 12 

43 7 30 11 

42 8 36 7 

66 1 72 2 

64 2 41 5 

46 5 57 3 
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Step 2 : Obtain the difference of Ranks d 


Judge Sam 


X 


60 
34 
40 
50 
45 
41 
22 
43 
42 
66 
64 
46 


O N e OO N 
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Judge Joe Difference of 

Y Ranks 
(d)=R,-R, 

75 1 +2 

32 10 +1 

34 

40 

45 4 

33 

12 12 

30 11 

36 7 

72 2 

41 5 

57 3 
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Step 2 : Obtain the difference of Ranks d 


Judge Sam 


X 


60 
34 
40 
50 
45 
41 
22 
43 
42 
66 
64 
46 


O N e OO N 
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Judge Joe Difference of 
Y Ranks 
(d) ZR, - R; 
75 1 +2 
32 10 +1 
34 u^ 
40 2 
45 4 +2 
33 
12 12 
30 11 -4 
36 7 +1 
72 2 -1 
41 5 -3 
57 3 +2 
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Step 3 : Obtain d? 


Judge Sam 


X 


60 
34 
A0 
50 
45 
41 
22 
43 
42 
66 
64 
46 


11 
10 


O N e OO N 
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Judge Joe Difference of 
Y Ranks 
(d) = R1- R, 
75 1 +2 
32 10 +1 
34 2 
40 2 
45 4 +2 
33 
12 12 
30 11 -4 
36 7 +1 
72 2 -1 
41 5 -3 
57 3 +2 
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Step 3 : Obtain d? 


Judge Sam 


X 


60 
34 
A0 
50 
45 
41 
22 
43 
42 
66 
64 
46 


11 
10 


O N e OO N 
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Judge Joe Difference of 
Y Ranks 
(d) = R1- R, 
75 1 +2 
32 10 +1 
34 u^ 
40 2 
45 4 +2 
33 
12 12 
30 11 -4 
36 7 +1 
72 2 -1 
41 5 -3 
57 3 +2 
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Step 4 : Obtain Xd“ 


Judge Sam Judge Joe Difference of 
X M Ranks 
(d)=R,-R, 
60 3 75 1 +2 4 
34 11 32 10 +1 1 
40 10 34 8 +2 4 
50 4 40 6 -2 4 
45 6 45 4 +2 4 
41 9 33 0 
22 12 12 12 0 
43 7 30 11 -4 16 
42 8 36 7 +1 1 
66 1 72 2 -1 1 
64 2 41 5 -3 9 
46 5 57 3 +2 4 


n=12 2d? = 


Step 4 : Obtain Xd“ 


Judge Sam 


X 


60 
34 
40 
50 
45 
41 
22 
43 
42 
66 
64 
46 


12 


11 
10 


U1 N e OO N 


Judge Joe 
Y 


75 
32 
34 
40 
45 
33 
12 
30 
36 
72 
41 
5/ 


10 


Difference of 
Ranks 
(d) =R,—-R, 


Step 5: Put values in Formula 
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Step 5: Put values in Formula 


6> ad? 


— q1 — — E 
F 7_2(72% — 1) 


^ -1- 6(48) 
12(144-1) 


4^ = 0.832 
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Q2. Spearman's Rank Correlation 


Marks assigned to 10 students by experts X & Y 
in a certain competitive test are as follows 


244241165. 68 43 38 77 48 
Y 


Apply the spearman’s rank correlation and obtain 
the result 
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Step 1 : Obtain the Ranks of X & Y 


Difference of 


Ranks 
(d)=R,-R, 


53 68 
42 43 
60 1 38 
45 77 1 
41 48 
37 35 
38 30 
25 25 
27 50 
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Step 1 : Obtain the Ranks of X & Y 


52 3 65 3 
53 2 68 2 
42 5 43 6 
60 1 38 7 
45 4 77 1 
41 6 48 5 
37 8 35 8 
38 7 30 9 
25 10 25 10 
27 9 50 4 
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Step 2 : Obtain the difference of Ranks d 


52 
53 
42 
60 
45 
41 
37 
38 
25 


27 


N CO Oo A FP HO N Q 


OB 
O 
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Difference of 
Ranks 
(d) = R,- R, 


Oo 00 n e N O N UW 
UJ 


Bob 
O 
u o 
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Step 3 : Obtain d? 


Difference of 


Ranks 
(d) ZR, - R; 

52 3 65 3 0 0 
53 2 68 2 0 0 
42 5 43 6 all 

60 1 38 7 -6 

45 4 77 1 3 

41 6 48 5 1 

37 8 35 8 0 

38 7 30 9 -2 

25 10 25 10 0 

27 9 50 4 5 
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Step 3 : Obtain d? 


Difference of 
Ranks 


(d) 


52 3 65 3 0 0 
53 2 68 2 0 0 
42 5 43 6 = 1 
60 1 38 7 -6 36 
45 4 77 1 3 9 
41 6 48 5 1 1 
37 8 35 8 0 0 
38 7 30 9 2 4 
25 10 25 10 0 0 
27 9 50 4 5 25 
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Step 4 : Obtain Xd? 


Difference of 


Ranks 
(d)=R,-R, 
52 3 65 3 
53 2 68 2 
42 5 43 6 -1 1 
60 1 38 7 -6 36 
45 4 77 1 3 9 
41 6 48 5 1 1 
37 8 35 8 0 0 
38 7 30 9 -2 4 
25 10 25 10 0 0 
27 9 50 4 25 


Step 5: Put values in Formula 
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Step 5: Put values in Formula 


6> ad? 


— q1 — — E 
F 7_2(72% — 1) 


^ -1- 6(76) 
10(100-1) 


A^" 0.539 
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Q3. Spearman's Rank Correlation 


The values of the 15 students in 2 subjects 
Statistics and APR are given, the 2 numbers 


within the brackets denote the ranks of the 
student: 


oO (14,12) (15,13) 
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Step 1: Fill the Ranks of X & Y 


Difference of Ranks 


(d) = R, - R, 


O 0o N OUUU A W N ebe 
e Uù oc A DN N 


= 
N e O 

[9 
no & 


13 14 
14 12 
15 13 
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Step 2 : Obtain the difference of Ranks 


Difference of Ranks 


(d) = R, - R, 


O 0o N OUUU A W N ebe 
e Uù oc A DN N 


= 
N e O 

[9 
no & 


13 14 
14 12 
15 13 
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Step 2 : Obtain the difference of Ranks 


Difference of Ranks 


(d) = R, - R, 


1 10 9 
2 7 -5 
3 2 +1 
4 6 -2 
5 4 +1 
6 8 -2 
7 3 +4 
8 1 +7 
9 11 2 


e e e he e 
A WN o B O 
e e m. 
N B V O u 
+ + E 

NF NUN N‘ 


15 13 +2 
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Step 3: Obtain d? 


Difference of Ranks 
(d)=R,-R, 


1 10 -9 81 
2 7 -5 25 
3 2 +1 
4 6 -2 
5 4 +1 
6 8 -2 
7 3 +4 
8 1 +7 
9 11 -2 


10 15 -5 


e e e BP 
A UÙ N Hm 
s Ee E 
++ + 
N e NN 


15 13 +2 
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Step 3: Obtain d? 


q 
2 
3 
4 
5 
6 
7 
8 
9 


e e e he e 
a WN e O 


15 


Business Statistics 


Difference of Ranks 
(d) = R, - R, 
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81 
25 


Step 4 : Obtain Xd? 


Rank , Difference of Ranks 
(d)=R,-R, 


1 10 -9 81 
2 7 -5 25 
3 2 +1 1 

4 6 -2 4 

5 4 +1 

6 8 -2 4 

7 3 +4 16 
8 1 +7 49 
9 11 -2 4 

10 15 -5 25 
11 9 +2 4 

12 5 +7 49 
13 14 -1 1 

14 12 +2 4 

15 13 +2 


n=15 yd? = 272 


Step 5: Put values in Formula 
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Step 5: Put values in Formula 


6> ad? 


— q1 — — E 
F 7_2(72% — 1) 


“> _1- 6(272) 
15(225-1) 


e" 8P0.5] 
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Spearman's Rank Correlation 


y 4 3 3 2 1 
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Repeated Rank Correlation 


e Sometimes characteristics are not measurable 
but ranks may be given to individuals 
according to their qualities. 


* In such situations rank correlation is used to 
know the association between two 
characteristics. This situation is called tied or 
repeated rank 
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Repeated Rank Correlation - Formula 


: Epi (nmm 


p=! n(n -1) 


where m; is the number of repetitions of * rank 
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Repeated Rank - Decision 


0,01 to 0.19; “Very Weak Agreement" (0.01) to (-0.19): "Very Weak Disagreement’ 
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Q1. Repeated Rank Correlation 


Compute the rank correlation coefficient for the 
following data of the marks obtained by 8 
students in the Commerce and Mathematics. 


aa soon 
Marks in Mathematics nmnonnnnn 
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Step 1 : Obtain the Ranks of X Y 


15 
20 
28 
12 
40 
60 
20 
80 


Business Statistics 


40 
30 
50 
30 
20 
10 
30 


60 
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Step 1 : Obtain the Ranks of X & Y 


15 
20 
28 
12 
A0 
60 
20 
80 


D PEN A ©. A! at. = 
Business Statistics 


A0 
30 
50 
30 
20 
10 
30 


60 


N 5» N 5 Oo 
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Step 2 : Obtain the difference of Ranks : | 


15 
20 
28 
12 
40 
60 
20 
80 


L QIN a Q. tiatine 
Business Statistics 


40 
30 
50 
30 
20 
10 
30 


60 


6 -4 

4 -0.5 
7 

4 

2 

1 

4 

8 
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Step 2 : Obtain the difference of Ranks : | 


15 
20 
28 
12 
40 
60 
20 
80 


L QIN a Q. tiatine 
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40 
30 
50 
30 
20 
10 
30 


60 


6 -4 
4 -0.5 
7 -2 
4 -3 
2 4 
1 6 
4 -0.5 
8 0 
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Step 3 : Obtain d? 


15 
20 
28 
12 
40 
60 
20 
80 
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40 
30 
50 
30 
20 
10 
30 


60 


6 -4 16 
4 -0.5 0.25 
7 -2 

4 -3 

2 4 

1 6 

4 -0.5 

8 0 
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Step 3 : Obtain d? 


15 
20 
28 
12 
40 
60 
20 
80 


= sasa ©. Liatinag 
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40 
30 
50 
30 
20 
10 
30 


60 


6 -4 

4 -0.5 
7 =2 

4 = 

2 4 

1 6 

4 -0.5 
8 0 
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Step 4 : Obtain Xd? 


Business Statistics 


40 
30 
50 
30 
20 
10 
30 
60 


6 -4 
4 -0.5 
7 2 
4 -3 
2 4 
1 6 
4 -0.5 
8 0 
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0.25 


Step 4 : Obtain Xd? 


15 2 
20 3.5 
28 5 
12 1 
40 6 
60 7 
20 3:5 
80 8 
n=8 
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40 
30 
50 
30 
20 
10 
30 
60 


6 -4 
4 -0.5 
7 2 
4 -3 
2 4 
1 6 
4 -0.5 
8 0 
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0.25 


Step 5: Put values in Formula 


)D ze -M ha Ane. 


lU 1 


p=1-b 


Business Statistics By: Dr.Gitanjali Shrivastava 


Step 5: Put values in Formula 


Repetitions of ranks 


* In Commerce (X), 20 is repeated two times 
corresponding to ranks 3 and 4. Therefore, 3.5 
is assigned for rank 2 and 3 with m,=2. 


In Mathematics (Y), 30 is repeated three times 
corresponding to ranks 3, 4 and 5. Therefore, 
4 is assigned for ranks 3,4 and 5 with m,=3. 
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Step 5: Put values in Formula 


81.5+— (2-2) +— (3 -3) 
pal lav — 12 
8 (8 -1) 


< 4181.5+0.5+2] _ 504 _ 
i 504 | 504 
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Step 6: Give Decision 


* Interpretation: Marks in Commerce and 
Mathematics are uncorrelated 
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Obtain the Rank correlation coefficient from the 
following data: 


Business Statistics By: Dr.Gitanjali Shrivastava 


Step 1 : Obtain the Ranks of X & Y 


Difference of 
Ranks 
(d)=R,-R, 


64 58 
75 68 
50 45 
64 81 1 
80 1 60 
75 68 
40 48 
55 50 


Step 5: Put values in Formula 


Repetitions of ranks 


In (X), 75 is repeated two times 
corresponding to ranks 2 and 3. Therefore, 2.5 
is assigned for rank 2 and 3 with m,=2. next 
value is 64, which gets the rank (5,6,7) avg 6, 
m,=3 


In (Y), 68 is repeated two times corresponding 
to ranks 3, 4. Therefore, 3.5 is assigned for 
with m.=2. 
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68 


64 
75 
50 
64 
80 
75 
A0 
55 


64 


2.5 
10 


Step 2 : Obtain the difference of Ranks d 


2.5 
10 


3:5 


10 


Step 2 : Obtain the difference of Ranks d 


64 6 58 7 =_ 
75 2.5 68 3:5 = 
50 9 45 10 =. 
64 6 81 1 +5 
80 1 60 6 =5 
75 2.5 68 3.5 = 
40 10 48 9 ti 
55 8 50 8 0 


| 


2.5 


10 


Step 3 : Obtain d? 


| 


7 -1 
3.5 -1 
10 -1 
1 +5 
6 -5 
3.5 -1 
9 +1 
8 0 

2 +4 


Step 3 : Obtain d? 


2.5 
10 


62 


58 
68 
45 
81 
60 
68 
48 
50 


70 


3:5 
10 


+4 


25 
25 


Step 4 : Obtain Xd? 


62 
58 
68 
45 
81 
60 
68 
48 
50 
70 


1 
1 
1 
1 


Step 4 : Obtain Xd? 


68 4 
64 6 
75 2.5 
50 9 
64 6 
80 1 
75 2.5 
40 10 
DD 8 
64 6 


62 
58 
68 
45 
81 
60 
68 
48 
50 
70 


1 
1 
1 
1 


Step 5: Put values in Formula 


)D ze -M ha Ane. 


lU 1 


p=1-b 
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Step 5: Put values in Formula 


Repetitions of ranks 


In (X), 75 is repeated two times 
corresponding to ranks 2 and 3. Therefore, 2.5 
is assigned for rank 2 and 3 with m,=2. next 
value is 64, which gets the rank 6, m,=3 


In (Y), 68 is repeated two times corresponding 
to ranks 3, 4. Therefore, 3.5 is assigned for 


with m.=2. 
Business Statistics By: Dr.Gitanjali Shrivastava 


Step 5: Put values in Formula 


e =1-6(72+1 [23-21+1 [33-3]+1 [2-2]) 
12 12 12 
10 (100-1) 


p -1-6(72 0.5 * 2 * 0.5) 
990 


> =0545 
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Step 6: Give Decision 


* Interpretation: Moderate type of correlation 
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Q3. Repeated Rank Correlation 


Obtain the Rank correlation coefficient from the 
following data: 


166 9 20 4 15 6 24 13 13 
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Step 1 : Obtain the Ranks of X & Y 


Difference of 
Ranks 
(d)=R,-R, 


16 6 

24 9 

65 1 20 

16 4 

16 15 

9 6 

40 24 1 
48 13 


Step 5: Put values in Formula 


Repetitions of ranks 


* In (X), 16 is repeated three times with m,=3. 


* In (Y), 13 & 6 is repeated two times with 
m;-2 m;-2. 
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57 2 16 3 
16 8 6 8.5 
24 6 9 7 
65 1 20 2 
16 8 4 10 
16 8 15 A 
9 10 6 8.5 
40 A 24 1 
48 3 13 5.5 


48 


33 


| 


c0 O N 


ea 


16 


13 


13 


Step 2 : Obtain the difference of Ranks d 


-0.5 


48 


33 


| 


c0 O N 


ea 


16 


13 


13 


Step 2 : Obtain the difference of Ranks d 


5.5 


Step 3 : Obtain d? 


16 3 -1 1 


5/ 2 | 

16 8 6 8.5 -0.5 0.25 
24 6 9 7 =_ 

65 1 20 2 =. 

16 8 4 10 =% 

16 8 15 4 4 

9 10 6 8.5 1.5 

40 4 24 1 3 

48 3 13 5.5 225 


33 5 13 


Step 3 : Obtain d? 


16 3 -1 1 


= | 
16 8 6 8.5 -0.5 0.25 
24 6 9 / = 1 

65 1 20 2 =. 1 

16 8 4 10 =% 4 
16 8 15 4 4 16 
9 10 6 8.5 15 2.25 
40 4 24 1 3 9 
48 3 13 5.5 25 6.25 


33 5 13 


Step 4 : Obtain Xd? 


Difference of 
Ranks 
(d) = R -R, 


65 1 20 2 =. 1 
16 8 4 10 -2 4 
16 8 15 4 4 16 
9 10 6 8.5 1.5 225 
40 4 24 1 3 9 
48 3 13 5:5 22:5 6.25 


mo 5 13 5.5 -0.5 0.25 


ek ae) e 179 _ 


Step 4 : Obtain Xd? 


Difference of 
Ranks 
(d) ZR,-R; 


65 1 20 2 E 1 
16 8 4 10 2 4 

16 8 15 A A 16 

9 10 6 8.5 1.5 2.25 
A0 A 24 1 3 9 

48 3 13 5.5 -2.5 6.25 
33 5 13 5.5 -0.5 0.25 


Step 5: Put values in Formula 


)D ze -M ha Ane. 


lU 1 


p=1-b 
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Step 5: Put values in Formula 


Repetitions of ranks 


* In (X), 16 is repeated three times with m,=3. 


* In (Y), 13 & 6 is repeated two times with 
m;-2 m;-2. 
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Step 5: Put values in Formula 


2 =1-6(41+1 [33-3]+1 [23-2]+1 [23-2]} 


12 12 12 
10 (100-1) 
e 
-1- 6 (41 0.5 * 2 * 0.5) 
990 
> = +0.734 
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Step 6: Give Decision 


* Interpretation: Moderate type of correlation 
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Regression Analysis 


* Regression analysis allows researchers to 
assess the strength of the relationship 
between an outcome (the dependent 
variable) and several predictor variables. 
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Regression Analysis -Concept 


* Regression analysis is a powerful statistical 
method that allows you to examine the 
relationship between two or more variables of 
interest. 


* While there are many types of regression 
analysis, at their core they all examine the 
influence of one or more independent 
variables on a dependent variable. 


Business Statistics By: Dr.Gitanjali Shrivastava 


Regression Analysis Features 


* Regression analysis is a set of statistical 
methods used for the estimation of 
relationships between a dependent variable 
and one or more independent variables. 


* |t can be utilized to assess the strength of the 
relationship between variables and for 
modeling the future relationship between 
them. 
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Regression Analysis - Use 


e The main uses of regression analysis 
are forecasting, time series modeling and 
finding the cause and effect relationship 
between variables. 


* Regression has a wide range of real-life 
applications. 
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Regression Analysis - Use 


* Regression can be used to make predictions 
about the value of the dependent variable 
based on new values of the independent 
variables. 


* For example, a regression can be used to 
predict the price of a house based on its 
square footage, number of bedrooms, 
location, etc 
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Regression Analysis - Types 


* The two basic types of regression are: 


1.Simple linear regression 
2.Multiple linear regression 


Although there are non-linear regression 
methods for more complicated data and 
analysis. 
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Regression Equation 


* A linear regression line has an equation of the 
form: 


Y-a-cbX, 


where X is the explanatory variable and Y is the 
dependent variable. 
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Regression Analysis - Applications 


Various applications includes: 

* Sales forecasting. 

* Customer lifetime value prediction. 
* Churn prediction. 

* Employee performance prediction. 
* Financial performance analysis. 

* Risk analysis and fraud detection. 


* Maintenance prediction. 


Business Statistics By: Dr.Gitanjali Shrivastava 


M 


PE 


Its QUESTION TIME !! 


Business Statistics SLS,Pune 


Business Statistics By: Dr.Gitanjali Shrivastava 


